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Abstract 

Many  descriptions  of  algorithms  in  computational  geometry  exclude  degeneracies  by  fiat.  Prac- 
titic  ners  are  left  to  their  own  devices  for  dealing  with  degeneracies  when  implementing  such 
algr^rithms.  Since  degeneracies  tend  to  be  numerous  and  hard  to  enumerate  exhaustively,  this 
is  often  a  reason  for  not  implementing  theoretical  algorithms.  This  paper  proposes  a  powerful 
symbolic  scheme  for  treating  degeneracies.  Our  method  is  simple  to  use,  and  is  applicable  for 
a  wide  variety  of  problems  in  computational  geometry  (in  particular,  whenever  random  pertur- 
bations are  applicable).  Our  method  is  deterministic  but  is  as  efficient  as  probabilistic  schemes. 
Illustrations,  limitations  and  wider  issues  are  discussed. 
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1      Introduction 

The  theoretical  study  of  algorithm  in  computational  geometry  is  an  active  area.  Besides  the 
inherent  beauty  arising  from  the  interplay  of  geometrical  and  algorithmic  properties,  this  area 
holds  the  promise  of  impact  on  important  application  areas  such  as  robotics,  graphics  and  VLSI. 
Unfortunately,  the  reduction  of  theoretical  algorithms  to  practice  has  been  relatively  slow  and 
this  has  often  been  commented  on.  In  our  view,  two  fundamental  issues  must  be  addressed  in 
order  to  speed  up  this  'technology  transfer'. 


• 


• 


Fixed  precision  arithmetic.  Theoretical  algorithms  assume  an  exact  (arbitrary  precision) 
model  of  numerical  computation.  Its  implications  in  the  world  of  fixed  precision  computa- 
tions are  not  fully  understood. 

Data  degeneracy.  Theoretical  algorithms  are  often  described  for  the  'non-degenerate'  cases 
of  the  inputs.  Sometimes,  even  careful  attempts  at  capturing  all  degenerate  cases  leave 
hard-to-detect  gaps. 

Both  problems  are  a  deep  source  of  frustration  for  practitioners  who  often  find  mysterious  fail- 
ures in  their  algorithms.  We  firmly  believe  that  theoreticians  are  justified  in  making  these  two 
assumptions  as  long  as  their  goal  is  the  understanding  of  the  global,  combinatorial  structure  of 
problems.  However,  it  would  not  justify  a  continuing  neglect  of  both  these  issues.  Both  raise 
extremely  interesting  questions  in  their  own  right.  Partly  because  of  such  neglect,  there  seems 
to  be  a  credibility  gap  between  theoreticians  and  implementors:  the  latter  often  view  theoretical 
algorithms  with  suspicion. 

Theoreticians  have  begun  to  address  these  questions.  For  instance,  the  recent  paper  [11], 
addresses  the  fixed  precision  issue  in  the  context  of  computer  graphics.  It  is  important  to  realize 
that  although  fixed  precision  arithmetic  and  date  degeneracy  are  related,  they  are  distinct  issues. 
In  this  paper  we  deal  with  the  latter.  The  symmetry  breaking  rules  in  simplex  algorithms 
(e.g.  [2])  are  the  precursors  of  symbolic  treatment  of  data  degeneracies.  In  computational 
geometry,  Edelsbrunner  and  his  students  are  among  the  first  to  publish  solutions  to  the  problem 
of  degeneracies  [6,8,7]  (see  chapter  9.4,  [9]).  Our  independently  discovered  scheme  wiU  turn  out 
to  be  a  generalization  and  simplification  of  their  method. 

Degeneracy  in  computational  geometry  is  a  general  phenomenon.  So  in  what  sense  can  we 
justify  its  neglect  in  theoretical  algorithms?  One  justification  is  that  explicit  handling  of  degen- 
eracies obscures  the  centrality  of  the  non-degenerate  cases:  degenerate  cases  normally  involve 
an  overwhelming  number  of  cases  that  are  disproportionate  to  their  likelihood  of  occurrence. 
But  an  implementor  of  these  algorithms  must  handle  the  degeneracies  when  they  do  arise.  Most 
authors  are  correct  in  suggesting  that  a  random  perturbation  of  data  will  remove  degeneracies 
with  high  probability.  Such  a  suggestion  is  justified  if  the  problem  is  stable,  as  the  case  generally 
turns  out  to  be.  However,  an  explicit  demonstration  that  the  problem  is  stable  is  seldom  done. 
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In  contrast  to  the  suggestion  of  random  perturbation,  it  is  occasionally  suggested  that  the  al- 
gorithmic description  ought  to  carefully  work  out  all  the  degenerate  cases.  We  take  the  opposite 
position  that  this  is,  in  general,  inadvisable:  it  is  neither  illuminating  for  a  global  understanding 
of  the  algorithm  nor  is  it  in  the  interest  of  the  implementor  who  would  then  have  to  implement 
the  numerous  degenerate  cases.  Besides,  the  increase  in  the  number  of  cases  may  lead  to  other 
programming  errors  or  incomplete  theoretical  analysis.  The  approach  advocated  in  this  paper 
is  to  work  out  some  general  scheme  to  achieve  suitable  data  perturbation.  In  this  way,  theo- 
reticians can  continue  to  focus  on  the  interesting  non-degenerate  cases  while  implementors  can 
enjoy  the  benefits  of  algorithms  that  have  few  cases  and  yet  can  handle  all  conceivable  inputs. 
(Paraphrased:  implementors  can  now  join  the  theoretical  paradise  in  which  degeneracies  are 
abolished.)  Thus  the  main  contribution  of  this  paper  is  a  practical  and  general  scheme  achieving 
these  goals. 

Overview.  The  rest  of  the  paper  is  organized  as  follows.  Section  2  explores  the  meaning  of 
degeneracy  and  related  concepts.  Section  3  reviews  probabilistic  perturbation  schemes  and  points 
out  unsatisfactory  properties.  To  illustrate  the  symbolic  approach,  we  outline  Edelsbrunner's 
method  in  section  4.  The  new  scheme  appears  in  section  5.  In  section  6,  we  explore  the  concept 
of  ordered  rings  implied  by  our  scheme.  Section  7  investigates  the  computational  and  complexity 
questions  arising  in  implementing  our  scheme.  We  conclude  with  some  directions  for  future  work 
in  section  8. 

2     What  is  geometric  data  degeneracy? 

How  can  a  general  solution  as  proposed  in  the  introduction  be  achieved?  We  first  need  an 
understanding  of  what  we  mean  by  degeneracies.  In  this  paper,  we  assume  that  the  problem 
input  is  a  sequence  of  real  numbers  called  the  input  parameters  a  =  (oi, . .  .,a„).  An  input  is 
degenerate  if  some  polynomial  Pj{xi, . . . ,  Xkj)  in  a  fixed  set 

D  =  {pj{xi,...,Xk^)  :  j  =  1,2,..., m  and  kj  >  1),    (m  >  1) 

evaluates  to  zero  when  an  allowable  substitution  for  the  variables  (ij, . . . ,  x^j)  is  made  using  the 
o,'s.  Here  the  set  D,  called  the  test  polynomials,  depends  only  on  the  algorithm  and  not  on  the 
inputs.  Although  £>  is  a  finite  set  here  (the  usual  case  in  practice),  our  method  works  as  well 
if  D  were  infinite.  The  'allowable  substitution'  of  variables  in  each  test  polynomial  by  input 
parameters  is  dictated  by  the  problem.  We  refrain  from  making  this  more  formal  but  this  will 
be  easy  to  do  (for  each  particular  case)  once  the  following  examples  are  understood. 

Examples  of  degeneracy.  When  input  a  represents  a  set  of  points  in  the  Euclidean  plane, 
degeneracies  include  (i)  two  coincident  points,  (ii)  three  coUinear  points,  or  (iii)  four  cocircular 
points.  If  the  input  represents  a  set  of  lines,  common  notions  of  degeneracy  include  (iv)  a  vertical 
line,  (v)  two  paraUel  lines,  (vi)  two  perpendicular  lines,  or  (vii)  three  concurrent  lines.  If  the  input 


2     WHAT  IS  GEOMETRIC  DATA  DEGENERACY?  3 

represents  points  and  lines,  degeneracy  may  include  (viii)  a  point  lying  in  a  line,  or  (ix)  a  line 
parallel  to  the  line  through  two  points.  This  list  can  go  on.  The  'allowable  substitution'  above 
may  only  amount  to  typing  the  variables  in  D  and  the  input  parameters  so  that  substitutions 
must  respect  the  type  distinctions.  For  instance,  some  variables  and  parameters  correspond  to 
the  first  coordinate  of  points  and  others  correspond  to  the  slope  of  lines,  etc.  More  concretely, 
suppose  the  input  parameters  are  01,61,02,62, ..  .,a„,6n  representing  n  points  in  the  plane,  and 
let  the  set  of  test  polynomials  consists  of  only  one  polynomial, 

A(xi,  2/1, 12,2/2,2^3,2/3) 

which  is  just  the  3  by  3  determinant  that  tests  if  the  points  (a:i,?/i),(a'2, 2/2)  and  (13,2/3)  are 
collinear  (cf.  section  4).  Allowable  substitution  here  means  that  (1)  the  substitution  of  the 
input  parameters  must  be  in  pairs  (i.e.  for  all  ij,  (a,-,  6,)  must  be  substituted  simultaneously 
for  (xj,2/_,)),  and  that  (2)  the  three  input  points  to  be  substituted  must  have  distinct  subscripts. 

Exact  model  of  numerical  computation.  It  is  important  to  realize  that  this  paper 
assumes  the  exact  model  of  numerical  computations.  All  numbers  are  represented  exactly:  for 
example,  to  represent  any  real  algebraic  number  a  exactly,  it  suffices  to  specify  a  polynomial 
p{x)  with  integer  coefficients  together  with  an  interval  /  containing  a  but  no  other  distinct  roots 
oi  p{x).  The  end  points  of/  are  exact,  say  represented  by  rational  numbers.  Our  development  is 
not  restricted  to  any  particular  exact  representation.  Note  that  since  the  integers  involved  (as  in 
the  coefficients  of  p(x),  or  in  representing  /)  can  be  arbitrarily  long,  we  sometimes  call  this  the 
arbitrary  precision  model  {ot,  somewhat  misleadingly,  'infinite  precision  model').  This  contrasts 
with  the  fixed  precision  world  of  the  numerical  analysts.  With  the  advent  of  computer  algebra, 
the  exact  models  are  becoming  more  important  and  indeed  unavoidable  for  some  applications. 

We  have  stated  that  the  issues  of  fixed  precision  and  data  degeneracy  are  distinct.  Indeed, 
any  attempt  to  consider  data  degeneracy  in  the  fixed  precision  models  faces  some  very  difficult 
problems:  for  instance,  there  are  many  puzzles  in  just  trying  to  define  what  it  means  for  three 
points  to  be  collinear  in  the  fixed  precision  model  (actually,  the  world  of  pixels).  In  any  Ccise, 
one  should  begin  by  understanding  degeneracy  in  exact  models. 

Degree  of  derivation  and  derived  degeneracies.  In  the  course  of  executing  algorithms, 
derived  values  b  =  (61,  62, . . .)  may  be  generated  from  the  input  parameters  a.  Let  us  define  the 
'degree  of  derivation'  of  b  be  the  least  integer  d  >  1  such  that  for  each  6,-  in  b  there  is  {d+  1)- 
variate  polynomial  p{x,xi, . .  .,xj)  with  integer  coefficients,  whose  degree  in  each  variable  is  at 
most  d  such  that  if  each  Xj  {j  =  l,...,d)  is  substituted  by  suitable  values  7j  from  a,  then  6, 
is  a  root  of  p(x,7i, . .  .,7d).  For  instance,  if  the  input  represents  points,  then  b  may  be  the 
computed  distances  between  pairs  of  points,  and  the  degree  of  derivation  is  2.  If  this  degree  d 
of  derivation  does  not  grow  with  the  input  size  n,  we  say  that  the  algorithm  has  bounded  degree 
of  derivation.  A  problem  is  of  bounded  degree  if  it  has  an  algorithm  with  bounded  degree  of 
derivation.  Examples  of  bounded  degree  problems  include  computing  the  convex  hulls  or  the 
Voronoi  diagrams.  Examples  of  problems  with  unbounded  degree  include  shortest  path  problems 
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and  root  isolation  (or  more  generally,  cell  decomposition).  Our  method  to  be  described  does  not 
allow  the  substitution  of  derived  values  with  derivation  degree  more  than  1  (this  is  really  the 
same  as  allowing  substitution  by  input  parameters  only),  but  see  section  8  for  discussion. 

Our  definition  of  degeneracy  above  can  be  generalized  thus:  we  say  that  the  input  parameters 
have  derived  degeneracies  if  substitution  by  derived  values  into  the  test  polynomials  of  D  results 
in  a  zero  value. 

Alternative  notions  of  degeneracy.  In  a  provocative  discussion,  [10],  Render  and  Freuden- 
stein  point  out  that  there  are  several,  not  necessarily  mutually-consistent,  notions  of  degeneracy 
in  the  literature.  Render  and  Freudenstein  propose  to  unify  these  disparate  notions  by  describ- 
ing degeneracies  to  be  a  'system  relative'  concept.  Of  course,  this  is  a  very  general  formulation 
and  our  particular  notion  here  is  'system  relative'  to  the  extend  that  the  particular  set  of  test 
polynomials  D  depend  on  the  algorithm. 

Degeneracies  are  often  depicted  as  rare  events.  In  computer  vision,  there  are  notions  of 
degeneracy  that  belie  this  view,  as  shown  by  this  example  from  [10].  Imagine  a  very  squat 
pyramid  and  a  view  of  the  pyramid  from  'below'.  This  view  shows  only  the  bcise  of  the  pyramid 
and  would  be  considered  a  'degenerate  view'  in  certain  contexts  of  vision  research.  Yet,  this 
bottom  view  is  hardly  'rare'  by  any  reasonable  definition  (in  the  sense  of  geometric  measures). 
Perhaps  it  is  better  to  caU  such  views  'deficient'  (since  they  give  inadequate  information)  rather 
than  degenerate. 

Perhaps  a  more  pertinent  illustration  which  suggests  that  the  'degeneracies  as  rare  events' 
view  require  careful  interpretation  is  this:  when  dealing  with  highly  structured  scenes  or  robot 
environments,  certain  'degeneracies'  such  as  parallel  lines  or  collinear  points  are  features  rather 
than  accidents  of  the  input  space.  For  instance,  in  descriptions  of  a  robot  environment  in  a 
factory,  we  expect  parallel  lines  to  be  a  common  feature.  The  explanation  lies  in  realizing  that 
we  normally  assume  that  the  input  space  (for  a  fixed  input  size  of  n)  is  the  set  of  aU  n-tuples 
of  (exactly  representable)  real  numbers.  It  may  happen  that  the  input  space  is  a  proper  subset 
of  all  possible  72-tuples  (usually,  a  submanifold);  'rarity  of  degeneracies'  must  be  relative  to  this 
manifold.  However,  our  method  as  it  stands  assumes  the  full  input  space. 

Problem  stability.  The  fundamental  assumption  in  this  paper  is  that  with  the  'rare  event' 
view  of  degeneracies,  we  can  perturb  away  the  degeneracy.  A  crucial  but  often  implicit  require- 
ment is  that  the  problem  at  hand  defines  a  function  from  the  input  space  to  the  output  space  that 
is  'continuous'  in  this  sense:  a  solution  to  a  perturbed  version  of  the  input  data  is  a  reasonable 
approximate  solution  for  the  original  data.  Then  we  are  indeed  justified  in  using  a  solution  to  the 
perturbed  input  as  the  final  output.  We  caU  such  problems  stable.  Since  continuity  arguments 
are  involved,  stability  is  relative  to  the  choice  of  topologies  on  the  spaces  concerned. 

Example  of  instability/stability.  Consider  the  problem  of  constructing  the  Voronoi  dia- 
gram of  a  set  of  planar  points  (sites).  We  would  like  to  show  that  this  problem  is  stable.  Let  us 
assume  that  certain  four  cocircular  sites  cause  the  Voronoi  diagram  to  have  a  Voronoi  vertex  of 
degree  4.  Any  perturbation  of  the  input  that  removes  the  cocircularity  of  these  four  sites  causes 
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the  said  \  oronoi  vertex  to  split  into  two  very  close  Voronoi  vertices  of  degree  3  each.  There 
are  two  combinatorially  distinct  ways  in  which  this  split  can  occur.  Clearly  the  combinatorial 
structure  of  the  perturbed  Voronoi  diagram  is  different  from  the  original  Voronoi  diagram.  So,  in 
the  combinatorial  sense,  the  problem  would  not  seem  stable.  Another  attempt  at  trying  to  show 
that  the  problem  is  stable  is  this:  use  the  Hausdorff  metric  on  closed  point  sets.  Unfortunately 
a  small  perturbation  may  introduce  an  infinite  Hausdorff  distance  between  the  two  Voronoi  di- 
agrams. [Consider  the  diagram  of  two  points  p  =  (-1,0)  and  q  =  (  +  1,0)  and  then  perturb  one 
of  the  points  top'  =  {6  -  1,0)  for  all  <5  >  0.] 

Now  suppose  our  goal  is  to  use  this  Voronoi  diagram  to  compute  an  obstacle-avoiding  path 
for  a  unit  disc  between  two  specified  points  P  and  Q,  viewing  these  sites  as  obstacles.  (It  follows 
from  [15],  that  to  find  such  a  path,  it  is  sufficient  to  look  for  one  path  in  which  the  center  of 
the  disc  lies  in  the  Voronoi  diagram.)  Let  the  set  of  n  sites  be  represented  by  a  £  f^  (where 
d  =  2n  and  £"''  is  the  Euclidean  d-dimensional  space).  It  is  natural  to  measure  the  'connection 
width'  between  any  two  points  P  and  Q  in  the  plane  by  the  quantity  C(P,  Q;a)  >  0  defined 
as  the  maximum  value  attained  by  the  clearance  of  some  path  from  P  to  Q  in  the  presence  of 
obstacles  defined  by  a.'.  Clearly,  if  C(P,  Q;a)  <  1  then  the  unit  disc  has  no  obstacle-avoiding 
path  connecting  P  and  Q.  Then  it  is  easy  to  show: 

Proposition  1  For  any  6  >  0  there  is  an  e  >  0  such  that  for  all  a,  b  £  S"^,  and  for  all  points  P 
andQ,  ifC{P,Q;&)  >  6  and  ||a-b||  <  e  then  \C{P,Q;a)  -  C{P,Q;h)\  <  S.  Here  ||a||  denotes 
the  Euclidean  norm. 

Such  a  stability  or  continuity  property  justifies  the  perturbation  of  input  a  in  this  appli- 
cation. This  illustrates  the  kind  of  justification  that  must  logically  precede  any  application  of 
perturbation  methods. 

Induced  and  inherent  degeneracy.  We  distinguish  between  inherent  degeneracy  of  the 
input  data  versus  an  algorithm-induced  degeneracy.  For  example,  if  the  input  represent  points  to 
a  convex  huU  algorithm,  it  is  apparent  that  if  three  consecutive  collinear  vertices  on  the  convex 
hull  ought  to  be  an  inherent  degeneracy  of  the  convex  huU  problem.  Now  if  the  algorithm 
uses  some  kind  of  vertical  partitioning  of  the  input  set  of  points,  or  uses  the  vertical  sweepline 
paradigm,  then  two  co-vertical  points  may  be  regarded  as  a  degeneracy.  These  degeneracies 
are  esisily  removed  in  this  case,  either  by  modifying  the  algorithm  or  by  perturbing  the  data 
deterministicaUy.  In  any  case,  they  are  degeneracies  induced  by  the  algorithm.  This  paper 
is  concerned  with  algorithm-induced  degeneracy.  It  seems  that  normally,  induced  degeneracies 
subsume  inherent  degeneracies. 


^The  clearance  of  a  point  i  is  its  distance  from  the  closest  site;  the  clearance  of  a  path  is  the  minimum  clearance 
among  points  along  the  path. 
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3  Probabilistic  schemes 

In  this  paper,  we  assume  a  computational  model  with  the  following  property.  There  is  a  fixed 
set  D  of  polynomials,  independent  of  the  input  such  that 

(*)  The  algorithm  only  makes  decision  steps  based  on  the  sign  of  polynomials  p(x)  G 
D  evaluated  at  x  =  (ii, . . . ,  x^)  '■=  [Oji , .  . ,  Oj^)  =  b  where  each  b  is  an  (allowable) 
substitution  from  the  input  parameters  a  =  (fli,. .  .,an)-  The  algorithm  then  makes 
a  3-way  branch  depending  on  the  ^ign  of  p(b),  with  the  case  p(h)  =  0  considered  to 
be  degenerate. 

The  goal  is  to  devise  a  data-perturbation  method  so  that  the  algorithm  never  takes  a  degenerate 
branch.  It  is  instructive  to  first  review  probabilistic  methods  for  data  perturbation  often  alluded 
to  in  the  literature. 

Perhaps  the  simplest  solution  is  to  arbitrarily  choose  either  p(b)  <  0  or  p(b)  >  0  whenever 
p(b)  =  0  is  encountered.  This  method  suffers  from  the  problem  of  global  consistency:  how  can 
transitivity  (i.e.,  p(b)  >  q(b)  and  ^(b)  >  r(b)  implies  p(b)  >  r(b))  be  maintained  without 
expensive  bookkeeping? 

The  initial  perturbation  scheme  overcomes  the  consistency  problem  by  precedirg  the  entire 
computation  with  an  initial  random  perturbation  of  the  input  data.  This  perturbation  ought  to 
guarantee 

(1)  No  new  degeneracies  arise  as  a  result  of  the  perturbation. 

(2)  The  original  degeneracies  are  all  removed. 

Property  (1)  can  be  satisfied  by  computing  some  a  priori  upper  bound  on  the  size  of  the 
perturbation.  Part  (2)  seems  more  difficult  to  ensure.  Granted  that  with  very  high  probability 
no  degeneracy  remains,  the  issue  remains  as  to  what  the  algorithm  must  do  when  a  degeneracy 
does  arise?  The  simplest  recovery  is  to  restart  the  algorithm  with  another  random  perturbation. 
This,  in  principle,  can  repeat  indefinitely.  Thus,  although  the  probabilistic  overhead  complexity 
of  the  initial  perturbation  scheme  is  small,  its  deterministic  complexity  is  unbounded.  This  is 
unsatisfactory  from  a  theoretical  viewpoint. 

4  Simulation  of  Simplicity 

We  now  turn  to  symbolic  perturbation  schemes.  To  illustrate,  we  briefly  review  a  version  of 
a  scheme  (called  simulation  of  sinr^Iicity,  or  SoS  for  short)  described  in  [6,8,7,9].  The  setting 
scheme  is  normally  carried  out  in  the  setting  of  computing  hyperplane  arrangements  where  the 
test  polynomials  are  determinants.  For  illustration,  say  a  set  H  of  lines  in  the  plane  is  simple  if 
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(i)  no  three  lines  are  concurrent,  (ii)  no  two  lines  are  parallel,  (iii)  no  two  pairs  of  lines  intersect 
on  a  common  vertical  line,  and  (iv)  no  two  pairs  of  lines  intersect  on  a  line  parallel  to  a  line 
in  H.  Each  of  these  conditions  corresponds  to  the  vanishing  of  a  suitable  determinant  over  the 
input  parameters.  Let  a  line  h,  £  H  he  given  by  y  =  a.i  +  6,.  Now  replace  each  input  line  /i,  by 
/i,(e)  with  equation  y  —  a,{()x  +  6,(f)  where 

a,(e)  =  a,  +  e^" ,  b,{e)  =  b,  +  e^""' 

It  is  then  shown  that  if  e  >  0  is  sufficiently  small,  then  the  new  arrangement  n{e)  is  simple. 
However,  instead  of  substituting  actual  values  for  e,  we  carry  out  the  calculation  symbolically. 
For  example,  to  decide  if  the  intersection  of  h,{€)  n  /i_,(e)  lies  above  the  line  /ifc(e),  we  must 
determine  the  sign  of 

/  a,{e)     6.(e)     1  \ 
A(e)  =  det       c_,(e)     6_,(e)     1 

V  ak{e)    bk{e)     1  / 

It  is  then  observed  that  evaluating  the  sign  of  A(e)  amounts  to  evaluating  a  sequence  of  subde- 
terminants  of  the  original  matrix  until  the  first  non-zero  entry: 


AfO    = 


2ij-i 


det 


Ofc 


We  will  show  that  this  is  actually  a  general  phenomenon.  One  may  regard  each  input  pa- 
rameter c,-  to  be  perturbed  by  an  infinitesimal  amount  S,  {S,  =  ^  in  the  preceding  illustration). 
Furthermore  there  is  a  suitable  fixed  total  ordering  on  the  set  of  infinitesimals  and  their  products. 
For  instance,  without  loss  of  generality,  we  may  assume 

h  <  ^2  <  •  •  •  <  «5.  <  •  •  •  <  <5„. 

But  what,  for  instance,  is  the  relation  between  ^1^3  and  f>\1  In  the  next  section,  we  wiU  give 
a  systematic  framework  for  making  such  comparisons.  We  note  that  [7]  also  uses  a  different 
infinitesimal  for  each  variable,  but  they  did  not  have  to  give  a  general  rule  for  comparing  products 
of  infinitesimals  since  they  restricted  attention  to  evaluation  of  determinants  only. 

Another  remark  is  that  it  is  possible  to  justify  such  uses  of  infinitesimals  in  terms  of  non- 
standard analysis. 
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5      A  general  scheme  to  avoid  zeroes 

We  describe  a  procedure  to  evaluate  any  polynomial  p(x)  at  any  value  x:=a.  The  procedure 
outputs  the  value  p(a).  However,  in  case  p(x)  is  a  non-zero  polynomial  and  p(a)  =  0,  then  the 
output  is  one  of  two  types  of  zeroes:  0-  or  0+.  This  sign  information  will  be  globally  consistent 
in  a  natural  sense.  Such  a  procedure  can  be  used  as  a  black-box  by  any  algorithm  satisfying 
assumption  (*)  above  to  always  avoid  the  degenerate  branch  of  a  decision  step. 

Observe  that  this  scheme  also  gives  us  a  method  of  comparing  the  values  of  two  distinct 
polynomials  p,  q  at  any  fixed  point  x  =  a.  More  precisely,  if  the  sign  of  the  difference  polynomial 
p  —  p  at  X  =  a  is  positive  then  we  say  p(a)  >  9(a);  otherwise  p(a)  <  9(a).  Extending  this,  it 
mea  -  that  we  can  strictly  order  any  set  of  distinct  polynomials  {pi, . .  .,p,}  by  their  'values'  at 
any  poi:,t  x:=  a. 

As  in  [14],  let  PP  =  PP(2;i , . . . ,  a^n)  denote  the  set  of  all  power  products 


w 


=  n^r  (^.^0)- 


«=1 


Let  \w\  denote  Yl?-i  ^«-  ^  total  ordering  <  on  PP  is  admissible  if  for  any  w,w',w"  G  PP, 

A 

1.  l<u; 

A 

2.  w  <.w'  implies  ww"  <  w'w". 

A  A 

The  two  most  important  examples  of  admissible  orderings  are  the  total  degree  ordering, 

denoted     <   ,  and  the  (pure)  lexicographical  ordering,  denoted     <   .    Let  w,v  be  two  power 

TOT  LEX 

products  with  degree  vectors  (ei,e2, . .  .,£„)  «Lnd  {d\,d2,...,dn).    Then  we  define  k;    <    t;  if 

LEX 

XV  =  r  or  else  e,-  <  <f,  at  the  smallest  index  i  where  the  e,'s  and  d.'s  differ.  We  also  define  w   <   v 

TOT 

if  |ti;|  <  \v\  or  else  (|u;|  =  |v|  and)  w    <    v. 

LEX 

Henceforth  we  assume  some  arbitrary  but  fixed  admissible  ordering  <  on  PP. 

A 

For  any  w  =  x^  ■  •  -x^  and  polynomial  p(x),  x  =  (ii,. .  .,x„),  let  p^  refer  to  the  lu;!""  partial 
differential  of  p,  where  p  is  differentiated  ejt  times  with  respect  to  each  variable  x^  {k  =  1, . .  .,n). 
For  example,  if  u;  =  x'^y  then  p^  =  gfif--  Let  S{p)  denote  the  infinite  list  of  polynomials 

S{p)  =  {Pwo,Pwi,Pw2---)    ■ 

where  wo,wi,W2,-  ■  •  is  the  list  of  power  products  in  PP  in  increasing  <-order.    In  particular 

A 
Wo  =    I  and  Pu,o    =  P-     This  infinite  sequence  has  only  a  finite  number  of  non-zero  entries 

and  assuming  that  that  p  is  not  identically  zero,  the  last  non-zero  entry  is  a  constant.    Our 
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polynomial  evaluation  procedure  proceeds  as  follows:  given  a  non-zero  polynomial  p(x)  and  a 
point  a  =  {ai,. .  .,a„),  we  evaluate  successive  polynomials  in  S(p)  at  x:=a  until  the  first  non- 
zero entry;  by  above  remarks,  termination  of  this  procedure  is  guaranteed.  In  case  p(a)  7^  0,  then 
we  return  p(a);  otherwise  we  return  either  0-1-  or  0—,  where  the  sign  of  the  zero  is  that  of  the 
first  non-zero  entry  evaluated  in  the  sequence  S{p).  It  is  convenient  to  introduce  the  notation 

S{p;  a)  =  (Pu,o(a),  Pwi  (a),  Pu^^i^)  ■  •  •)• 

Len:  ma  2 

(a)  5(p;  a)  is  the  sequence  of  all  zeroes  iff  p  is  identically  zero. 

(b)  For  any  p  and  a,  the  polynomial  p  is  uniquely  determined  by  the  sequence  S{p;  a). 

Proof,  (a)  follows  from  our  above  remark  that  any  non-zero  polynomial  has  a  derivative  that 
is  a  non-zero  constant,  (b)  Suppose  that  p,q  are  two  polynomials  such  that  5(p;  a)  =  S{q;a). 
Then  S{p-  q\a)  =  0  (the  sequence  of  all  zeroes).  Then  part  (a)  implies  that  p  -  5  is  identically 
zero,  i.e.  p  =  q-  Q.E.D. 

This  proof  does  not  tell  us  how  to  reconstruct  the  polynomial  p  from  the  sequence  5(p;a) 
but  it  is  easy  to  give  a  formula  to  reconstruct  5(p;a)  from  5(px, ;a)  for  all  i  =  1, . .  .,n,  where 
Pj-,  ,  of  course,  is  partial  differentiation  with  respect  to  i,-. 

The  evaluation  procedure  amounts  to  a  function  that  assigns  a  sign  (zero,  positive  or  negative) 
to  every  polynomial,  where  each  non-zero  polynomial  is  either  positive  or  negative. 

We  define  p(x)  >  ^(x)  to  mean  that  the  polynomial  p(x)  —  g(x)  has  positive  sign  at  x:=  a. 

Corollary  3  For  all  polynomials  p  and  q,  exactly  one  of  the  following  relation  holds:  p  =  q  or 

p>  q  or  q>  p. 
a  a 

6      Ordered  Rings 

We  now  show  that  the  relation  >  is  a  total  ordering  and  has  other  algebraic  structure  as  well. 

There  is  a  beautiful  theory  of  ordered  fields  due  to  Artin  and  Schreier  [18].  One  sees  that  the 
definition  of  ordered  fields  in  [18]  only  uses  the  ring  properties  of  fields.  Accordingly,  we  may 
adapt  that  definition  to  rings.  Our  ring  R  wiU  be  assumed  to  be  commutative  with  a  unit  1. 

A  partially  ordered  ring  R  is  a.  ring  with  an  associated  sign  function  a  :  R  —^  {-1,0, -|-l} 
such  that  for  all  a,  6  6  R: 

Al  a{a)  =  —a(-a).  Ln  particular,  a{a)  =  o-(-a)  iflf  cT(a)  =  0. 

A2  a(ab)  =  (T(a)a{b) 

A3  (T{a)  >  0  and  a{b)  >  0  implies  a{a  -|-  6)  >  0,  with  equality  iff  a{a)  =  a{b)  =  0. 
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Call  a  ring  element  a  positive,  negative  or  nullary  according  to  the  sign  a{a)  of  a.  Note  that 
axiom  (Al)  implies  that  a(0)  =  0,  and  (A2)  implies  ct(1)  =  +1.  If  the  sign  function,  in  addition, 
satisfies  the  property 

A4  o{a)  =  0  if  and  only  if  a  =  0 

then  we  call  R  an  ordered  ring. 

We  define  a  relation  >  on  R  where 

a  >  6  if  and  only  if  (T{a  -  6)  =  +1. 

Also,  define  a  <  6  if  either  a  =  b  or  b  >  a.  We  get  the  expected  properties  as  in  [18]:  transitivity 
follows  from  axiom  (A3);  a>  b  implies  a  +  c  >  6  +  c,  and  if  c  is  positive,  then  ac  >  be  holds  as 
well.  And  a  >  6  if  and  only  if  b~^  >  a~^ .  Of  course,  <  is  a  partial  ordering  on  R,  and  if  R  is  an 

<T  <r  cr 

ordered  ring  then  <  becomes  a  total  ordering.  The  following  two  consequences  are  less  obvious: 

Lemma  4 

1.  In  a  partially  ordered  ring,  a  +  b>  a'  +  b'  implies  a>  a'  or  b>  b'. 

a  a  a 

2.  In  an  ordered  ring,  if  a  and  b  are  both  positive  and  ab  >  a'b',  then  a>  a'  or  b>  b'. 

a  a  a 

Proof.   1.   a{a  +  6  -  a'  -  6')  =  +1  implies  that  a{a  -  a')  =  +1  or  a{b  -  b')  -  +1  (otherwise 
cr(a'  -  a)  >  0  and  a{b'  -  b)  >  0  and  axiom  (A3)  implies  (r{a'  +  6'  -  a  -  6)  >  0,  contradiction). 
2.  If  a{a  —  a')  =  0  then  a  =  a'  (since  R  is  an  ordered  ring)  and  ab  >  a'b'  implies  ab  >  ab'.  This 

means  a{a{b  -  b'))  -  +1  and  since  a{a)  =  +1,  we  get  c{b  -  b')  =  +1,  proving  the  desired 
result.  Similarly  if  a{b  —  b')  =  0.  Therefore,  if  we  assume  the  result  is  false,  we  must  have 
a{a  -  a')  =  a{b  —  b')  =  -1.  We  get  a  contradiction  as  follows.  First,  these  assumptions  imply 
(T{{a'-a){b'-b))  =  +1.  It  follows  that  a{{a'-a){b' -b)  +  {ab-a'b'))  =  +1  Rearranging  terms,  we 
get  a{a{b-b')  +  b{a-a')  =  +1.  Using  (A3),  we  infer  that  c7(c(6-6'))  =  +1  or  cr(6(a-a'))  =  +1. 
If  a{a{b  -  b'))  =  +1  then  applying  (A2)  with  a{a)  =  +1,  we  get  ct(6  -  b')  =  +1  which  is  a 
contradiction.  A  similar  contradiction  arises  if  a(6(a  —  a'))  =  +1.  Q.E.D. 

The  main  examples  of  ordered  fields  are  the  rational  numbers  and  the  reals.  In  [18]  (exercises), 
it  is  pointed  out  that  we  can  also  order  the  set  of  univariate  polynomials  with  coefficients  over  an 
ordered  field  by  regarding  the  sign  of  a  polynomial  to  be  the  sign  of  the  leading  coefficient.  This 
can  be  generalized  to  multivariate  polynomials  once  we  have  fixed  an  admissible  ordering  on  the 
power  products:  then  we  may  speak  of  the  'head  monomial'  and  take  the  sign  of  a  polynomial 
to  be  the  sign  of  the  coeflRcient  of  the  head  monomial.  We  now  show  another  family  of  total 
orderings  based  on  our  polynomial  evaluation  scheme. 
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Let  Q  be  any  ordered  ring  and  let  R  -  Q[xi, . . .  ,!„]  be  the  ring  of  n-variate  polynomials 
with  coefficients  from  Q.  Let  <  be  any  fixed  admissible  ordering  on  the  power  products  PP  = 

PP(Xi,...,X„). 

Lemma  5   Let  u,v,u',v'  £  PP  such  that  either  u  i^  u'  or  v  ^  v' .    If  uv<u'v'  then  u  <  u'  or 

A  -4 

V  <  v' . 

A 

Proof.  Let  the  exponents  of  xj, . .  .,a;n  in  u  and  u'  be  u  =  (ui, . .  .,t^n)  and  u'  =  (uj, . . .,  w'„), 
respectively.  Similarly  for  let  v,  v'  denote  the  exponents  with  respect  to  v,v'.  By  a  character- 
ization of  admissible  orderings  (see  [5]),  there  corresponds  to  <  an  n  by  n  matrix  \V  with  real 

A 
entries  such  that  uv  <  u'v'  iff 

A 

(u  +  v)iy    <   {u'  +  v')W. 
LEX 

Let  y  =  uW,  z  =  vW,  y'  =  uW  and  z'  =  v'W.  Let  the  ith  component  of  y  (resp.  y',  z,  z')  be 

j/,(resp.  yl,  Zi,  r,').  If  i  is  the  first  index  where  either  ?/,  ^  y',  or  z,  ^  z[  then  since 

J/.  +  y[  <Zi^  z\ 
we  must  have  either  i/,-  <  y\  or  Zi  <  z[.  This  means  either  u<u'  or  v  <  v'.  Q.E.D. 

A  A 

We  now  extend  the  sign  function  on  Q  to  a  sign  function  on  i?  =  Q[x\, . . . ,  Xn]  according  to 
our  evaluation  scheme.   Fix  any  point  a  G  Q"  and  any  admissible  ordering  <  on  PP.   For  any 

A 

p  £  R,  vfe  again  have  the  sequence  S{p)  of  its  partial  derivatives  ordered  according  to  <,  and 

A 
also  the  sequence  S{p;  a)  obtained  by  evaluating  S{p)  at  a.   Define  the  sign  cr{p)  of  p  to  be  the 

sign  (since  Q  is  ordered)  of  the  first  non-zero  entry  in  S(p;a).  li  all  entries  in  5(p;a)  are  zero, 

then  a{p)  is  defined  to  be  0. 

Theorem  6  R  —  Q[x\, . . . ,  x„]  with  the  sign  function  a  as  determined  by  the  evaluation  scheme 
is  an  ordered  ring.  Note  that  a  depends  on  the  choice  of  admissible  ordering  <,  on  the  choice  of 

A 

a  €  Q",  and  on  the  sign  function  on  Q. 

Proof.  We  must  verify  the  four  axioms  (A1-A4).  Axiom  (Al)  follows  from  the  fact  that  S{p;  a)  = 
—  5(-p;a),  and  (A4)  comes  from  the  fact  that  5(p;  a)  has  some  non-zero  constant  entry  unless 
p  is  identically  zero.  (A3)  comes  from  the  fact  that  S{p  +  q;  a)  =  S{p;  a)  4-  S{q;  a)  where  we  have 
componentwise  addition  of  the  sequences.  To  show  (A2),  suppose  that  the  first  non-zero  entry 
in  S{p)  (resp.  S{q))  is  p^  (resp.  q^)  where  u,v  £  PP.  We  claim  that  the  first  non-zero  entry  in 
S{pq)  is  {pq)uv  (i-^-  the  partial  derivative  of  p^  with  respect  to  uv).  We  see  that 


{pq)uv  =  ^Pu'^tv 
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where  u'  €  PP  range  over  all  divisors  of  uv  and  v'  is  given  by  u'v'  =  uv.  By  the  previous  lemma, 

unless  u  =  u'  and  v  =  v',  u'v'  <  uv  implies  u>u'  or  v>  v'.    Hence  p^,  =  0  or  p^,  =  0.    We 

A  A  A 

conclude  that  {pq)uv  =  Pulv 

Suppose  uv  >  u'v'.  The  same  argument  as  above  shows  {pq)u'v'  =  0-  It  follows  that  the  sign 

A 
oi  pq  is  equal  to  the  sign  of  pu{a.)q,j{a),  which  is  equal  to  (T{p)a{q).  This  proves  (A2).  Q.E.D. 

As  an  example,  we  may  obtain  as  a  corollary  that  the  scheme  in  [7]  is  an  ordering  on  the  ring 
of  polynomials  since  it  can  been  seen  that  his  choice  of  perturbation  leads  to  the  lexicographical 
ordering  on  PP;  this  observation  does  not  immediately  come  out  in  the  original  paper  because 
the  original  paper  were  restricted  to  evaluating  determinants. 

7      Complexity:  evaluation  of  sparse  polynomials 

Since  all  evaluation  of  polynomials  are  to  be  done  through  our  'black-box',  it  is  important  for  this 
black-box  to  be  efficient.  We  now  consider  this  issue.  Let  L(n,  s,  d)  denote  the  minimum  number 
of  arithmetic  operations  (  +  ,  -,  X,  -f)  sufficient  to  evaluate  aU  polynomials  with  s  monomials  on 
n  variables,  where  each  variable  ha^  degree  at  most  d.  Hence  5  <  c?  -f  1  is  a  measure  of  the 
sparsity  of  the  polynomial  and  we  call  this  the  sparse  complexity  moJe/ of  polynomial  evaluation. 
Little  work  has  been  done  on  this  model  except  in  special  cases.  Nevertheless,  the  model  is  very 
important  especially  in  the  multivariate  setting. 

To  begin,  we  note  that  our  algorithm  need  not  precompute  the  (non-zero  entries  of  the) 
sequence  S{p).  Instead  it  can  compute  successive  entries  on  the  fly.  This  is  because,  using  stan- 
dard admissible  orderings  such  as  total  degree  or  lexicographical  orderings,  it  is  relatively  simple 
to  generate  successive  polynomials  in  the  sequence.  Otherwise,  we  know  that  each  admissible 
ordering  is  characterized  by  a  real  square  matrix  M  and  if  M  has  computable  elements,  we  can 
also  compute  successive  non-zero  entries  of  5(p). 

For  the  evaluation  of  univariate  polynomials,  we  have  relatively  complete  knowledge  about  the 
worst  case  complexity  of  the  two  extremes  of  sparsity:  the  dense  polynomial  (i.e.,  a  polynomial  of 
degree  d  has  d+1  non-zero  coefficients)  and  the  totally  sparse  polynomial  which  consists  of  only 
one  monomial  (i.e.,  the  addition  chain  problem).  From  [19,16],  we  may  deduce  that  evaluating 
an  n-variate  polynomial  with  5  monomials,  where  the  degree  in  each  variables  is  less  than  d  has 
complexity  at  most 

w  +  v\ogd+ H/logH +  o{n/\ogn)  (1) 

where  H  =  ns log d  and  v  =  min{5,n},  w  =  max{5,ri}.  This  bound  is  tight  if  we  insist 
on  evaluating  the  monomials  as  separate  entities.  For  evaluating  polynomials,  this  is  a  real 
restriction.  For  instance,  evaluating,  (xi  -f  i2)(3;3  +  X4)  as  a  polynomial  requires  strictly  fewer 
arithmetic  operations  than  evaluating  the  four  component  monomials  separately. 
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In  our  applications,  we  not  only  want  to  evaluate  a  polynomial  but  may  need  to  evaluate 
some  of  its  derivatives  as  well.  There  is  literature  on  evaluating  a  dense  univariate  polynomial 
and  all  of  its  derivatives  (e.g.  [l]).  They  are  not  directly  suitable  for  our  application  since  we 
need  sparse  multivariate  polynomials,  and  generally  do  not  evaluate  derivatives  of  every  order. 
(See  also  [12,13].)  The  sparse  model  of  polynomial  evaluation  lends  itself  nicely  to  the  derivative 
evaluation  problem  since  a  derivative  polynomial  is  at  least  as  sparse  as  the  original  polynomial. 

Instead  of  using  the  asymptotically  optimal  method  of  Pippenger,  we  use  a  simpler  scheme 
based  on  Yao's  method.  The  method  basically  says  that  given  x  raised  to  each  power  of  2  less 
than  d,  we  can  evaluate  x'^  is  iJ°fo'^j[l  +  o(l)]  steps.  Hence  any  5  term  n-variate  polynomial 
can  be  evaluated  in  i^^°f''i  [1  +  0(1)]  steps,  (assuming  an  initial  cost  of  n\ogd  to  evaluate  each 
variable  raised  to  powers  of  2  less  than  d.)  For  our  application,  the  advantage  of  this  simple 
scheme  comes  from  the  potential  need  to  evaluate  derivatives:  each  subsequent  derivative  can 
be  evaluated  using  the  same  method.  The  space  usage  is  0(n\ogd)  to  store  the  variables  raised 
to  powers  of  2.  For  moderate  values  of  5,  this  space  can  be  improved  without  loss  of  speed: 
by  evaluating  aU  the  necessary  powers  of  one  variable  before  proceeding  to  the  next  variable, 
and  accumulating  them  in  each  of  the  >  'partially  formed'  monomials,  we  reduce  the  space  to 
0(5  + log  d). 

In  the  univariate  case,  we  can  actually  get  a  uniform  method  of  polynomial  evaluation  whose 
complexity  is  optimal  (up  to  smaller  order  terms)  for  the  entire  range  of  s: 

I(l,3,cf)<l0gd+/yf/;\  [1  +  0(1)] 

loglog(d/5) 

To  see  this,  we  use  the  'generalized  Horner  factorization'  of  a  polynomial: 

p{x)  =  aix'''  (1  +  Cix'^ni  +  •••(!  +  ay'). . .)) 

where  5Z*=i  di  <  d.  We  first  evaluate  x^  where  k  <  d  range  over  powers  of  2.  Now  evaluate  each 
i'^'  in  lo  "fo  /j  \  [1  +  0(1)]  steps.  Summing  over  all  i  =  1, . . . ,  s,  and  observing  that  the  complexity 
is  maximized  when  each  d,-  =  dfs,  we  get  our  result.  Unfortunately,  this  method  does  not  seem 
to  generalize  to  multivariate  polynomials. 

An  important  observation  is  that  one  is  unlikely  to  have  to  evaluate  more  than  the  first  few 
polynomials  in  the  sequence  before  turning  up  a  non-zero  value.  The  small  probability  involved 
here  seems  to  be  related  to  the  small  probability  of  residual  degeneracies  in  probabilistic  scheme. 

At  a  small  extra  cost  in  space,  we  can  speed  up  the  evaluation  of  subsequent  entries  in  S{p). 
This  is  based  on  the  observation  that  evaluating  successive  polynomials  in  the  sequence  S{p)  are 
closely  related.  The  idea  is  that  in  the  above  method  for  evaluating  the  monomial  xf^X2^  ■  •  -x^", 
we  keep  around  the  values  of  x,'  for  each  i.  Then  in  the  derivative  (say  with  respect  to  Xi),  we 
must  evaluate  the  monomial  Xj^^^Xj^  •  •  -x^"  and  this  can  be  obtained  in  n  +  log  di  /  log  log  rfi 
multiplications  (instead  of  nlogdi/ loglogdi).  Other  methods  to  trade  off  time  for  space  can 
also  be  devised. 
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8     Directions  for  future  work 

1.  The  black-box  we  provide  is  not  invariant  under  change  of  coordinate  axes.  For  example, 
with  f{x,y)  =  X  and  g{x,y)  =  y  our  test  yields  the  inequality  /(0,0)  >  p(0,0)  (assuming  the 
ordering  x  >  y).  But  if  we  rotate  the  coordinate  axes  of  the  plane  by  angle  6,  the  transformed 
functions  become  F{x,  y)  =  x  cos  0  -  ys'md  and  G{x,  y)  =  x  s'lnB  +  y  cos  6.  If  |^|  >  7r/2  then  we 
get  ^(0,0)  <  G(0,0),  which  is  opposite  to  the  unrotated  case. 

2.  We  have  defined  our  scheme  for  evaluation  of  polynomials.  It  is  easy  to  extend  this  to 
rational  polynomial  functions,  /(x)  =  p(x)/9(x).  Such  rational  functions  might  arise  in,  say  the 
solution  of  linear  systems  (using  Cramer's  rule).  We  form  the  sequence  5(/)  =  (/i,/2,..-)  as 
usual.  It  is  not  hard  to  check  that  again,  this  sequence  has  a  finite  number  of  non-zero  entries, 
and  there  are  constant  entries  among  them.  Hence  the  'first  non-zero  element"  iu  5(/;a)  is 
well-defined.  Now  the  ordered  rings  become  ordered  fields. 

3.  What  about  evaluation  of  algebraic  functions?  By  an  algebraic  function  we  understand  a 
continuous  function  r(x)  such  that  for  some  polynomial  p(z;x), 

p(r(x);x)  =  0. 

For  each  x:=a,  we  get  an  algebraic  number  r(a).  (As  discussed  in  section  2,  such  algebraic 
numbers  are  represented  exactly.)  Perhaps  the  most  common  instance  is  the  square-root  function 
r(x)  =  y/xj,  where  p(z;x)  =  z"^  —  Xi.  If  we  want  to  compare  the  distance  d{p,q)  between 
two  points  p,  q  with  some  value  v,  then  we  could  clear  square-roots  and  set  up  the  test  as 
d{p-iiy  —  v^  =  0.  One  cannot  do  such  transformation  of  the  test  polynomial  with  impunity 
because  they  affect  the  infinitesimals.  For  example,  if  /(i)  =  i^,  F{x)  =  /(i)^  =  x'*  and 
g{x)  =  x^  then  we  have  /(O)  >  ^(0)  >  F{0)  assuming  total  degree  admissible  ordering.  Hence, 
the  comparison  /(O)  :  ^(0)  is  not  equivalent  to  F{0)  :  g{0)  (though  it  would  be  equivalent  to 
F(0)  :  G(0)  where  G  =  g^).  To  treat  algebraic  functions  in  general,  we  can  proceed  as  follows: 
suppose  r(x)  is  an  algebraic  function  satisfying  p(r(x);x)  =  0.  Then  the  partial  derivatives  of 
r(x)  is  obtained  by  the  chain  rule,  and  we  can  define  the  sequences  S{r)  and  S{r;  a)  in  the  usual 

way.  For  instance,  in  the  case  of  the  square  root  function,  -^^  =  l/2r(x)and  —^^  =  -l/r(x)^. 
The  computational  details  of  carrying  out  such  a  scheme  especially  if  the  degree  of  derivation  is 
unbounded  are  nontrivial. 

4.  In  some  applications,  there  are  restrictions  on  the  perturbation.  In  some  hidden  surface 
removal  algorithms,  there  is  a  basic  test  to  see  if  two  triangles  A,B  obscure  one  another.  Let 
the  polygon  C  be  the  intersection  of  the  projections  A',  B'  of  the  triangles  onto  the  viewplane 
(ly-plane).  So  C  has  at  most  6  sides.  Take  any  vertex  P  in  C.  By  projecting  P  back  to  A  and 
B,  we  get  Pa  and  Pb-  The  test  reduces  to  comparing  the  2:-coordinate  of  Pa  and  of  Pg.  K  their 
^-coordinates  are  equal,  we  want  to  do  the  same  test  on  a  perturbed  version  of  P.  We  must 
ensure  that  perturbed  P  does  not  go  outside  C:  conceptually,  we  add  to  P  an  infinitesimal  vector 
directed  to  the  interior  of  C .  Unfortunately,  our  present  perturbation  scheme  is  not  helpful  here. 
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5.  Another  more  far  reaching  direction  is  to  to  handling  uncertainty  in  the  input  data.  This 
is  related  to  the  issue  of  treating  finite  precision  arithmetic.  A  basic  question  to  be  faced  is 
this:  what  is  the  meaning  of  degeneracy  in  the  world  of  imprecise  input  or  in  the  world  of  finite 
precision?  This  is  a  difficult  question,  related,  for  instance,  to  the  problem  defining  the  digital 
analogues  of  lines  and  circles. 

6.  Computational  experience  and  applications  to  actual  algorithms  in  computational  geom- 
etry ought  lo  be  carried  out  to  expose  other  issues  in  implementing  our  scheme:  for  instance, 
what  is  a  good  choice  of  admissible  ordering? 

7.  Other  connections  that  could  be  investigated:  work  on  numerical  stability  and  geometric 
probability  [3,4],  singularity  and  catastrophe  theory  [17].  Our  method  is  not  directly  related  to 
symmetry  breaking  rules  in  the  Simplex-based  algorithms  (see  [2]),  but  the  connection  is  not 
fully  explored. 
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